Figure mining for biomedical research
نویسندگان
چکیده
MOTIVATION Figures from biomedical articles contain valuable information difficult to reach without specialized tools. Currently, there is no search engine that can retrieve specific figure types. RESULTS This study describes a retrieval method that takes advantage of principles in image understanding, text mining and optical character recognition (OCR) to retrieve figure types defined conceptually. A search engine was developed to retrieve tables and figure types to aid computational and experimental research. AVAILABILITY http://iossifovlab.cshl.edu/figurome/.
منابع مشابه
Are figure legends sufficient? Evaluating the contribution of associated text to biomedical figure comprehension
BACKGROUND Biomedical scientists need to access figures to validate research facts and to formulate or to test novel research hypotheses. However, figures are difficult to comprehend without associated text (e.g., figure legend and other reference text). We are developing automated systems to extract the relevant explanatory information along with figures extracted from full text articles. Such...
متن کاملClassification of Figures in Biomedical Literature toward a Figure Finding System
As biomedical full-text papers are becoming more available in digitized form on-line, there is a need for tools to mine information from all parts in the papers. Notably, since figures and their legends/captions in biomedical papers provide important information about research outcomes, mining techniques targeting them have attracted a great deal of attention. However, even a simple-sounding ta...
متن کاملDeTEXT: A Database for Evaluating Text Extraction from Biomedical Literature Figures
Hundreds of millions of figures are available in biomedical literature, representing important biomedical experimental evidence. Since text is a rich source of information in figures, automatically extracting such text may assist in the task of mining figure information. A high-quality ground truth standard can greatly facilitate the development of an automated system. This article describes De...
متن کاملIntegrating image data into biomedical text categorization
Categorization of biomedical articles is a central task for supporting various curation efforts. It can also form the basis for effective biomedical text mining. Automatic text classification in the biomedical domain is thus an active research area. Contests organized by the KDD Cup (2002) and the TREC Genomics track (since 2003) defined several annotation tasks that involved document classific...
متن کاملEurope PMC: Quick tour
What is Europe PMC? Europe PMC [2] is a global, free, biomedical literature repository, providing access to worldwide life sciences articles, books, patents and clinical guidelines. The resource currently contains over 32 million abstracts and more than 4 million full-text articles (see Figure 1). A subset of the full-text information corpus is the open-access literature that can be downloaded ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 25 16 شماره
صفحات -
تاریخ انتشار 2009